My Notes

Created: 2026-03-06 07:53:04

Updated: 2026-03-06 07:53:04

3.1 The AEP

The asymptotic equipartition property is formalized in the following theorem:
Theorem 3.1.1: If $X_{1},X_{2},\dots$ are i.i.d $\sim p(x)$ , then

$-\frac{1}{n}\log p(X_{1},X_{2},\dots,X_{n})\to H(X) \qquad\text{in probability}$

proof: 独立随机变量的函数依然是独立随机变量，因此

$\begin{align} -\frac{1}{n} \log p(X_{1},X_{2},\dots,X_{n}) & =-\frac{1}{n}\sum_{i}\log p(X_{i}) \\ & \to -E\log p(X) & \text{in probability} \\ & =H(X) \end{align}$

定义： $p(x)$ 的典型集合(typical set) $A_{\epsilon}^{(n)}$ 是满足如下条件的序列 $(x_{1},x_{2},\dots x_{n})\in\mathscr{H}^{n}$ ：

$2^{-n(H(X)+\epsilon)}\leq p(x_{1},x_{2},\dots,x_{n})\leq 2^{-n(H(X)-\epsilon)}$

由于AEP的性质，我们可以得出 $A_{\epsilon}^{(n)}$ 的如下性质：

$(x_{1},x_{2},\dots,x_{n})\in A_{\epsilon}^{(n)}\implies H(X)-\epsilon\leq-\frac{1}{n}\log p(x_{1},x_{2},\dots,x_{n})\leq H(X)+\epsilon$
proof:
从 $A_{\epsilon}^{(n)}$ 的定义可直接得出。
$\text{Pr}\{A_{\epsilon}^{(n)}\}>1-\epsilon$ for $n$ sufficiently large
proof:由Theorem 3.1.1，

$\text{Pr}\left\{ \mid-\frac{1}{n}\log p(X_{1}X_{2}\dots X_{n})-H(X)\mid <\epsilon\right\}\to1\qquad\text{in probility}$
因此任意 $\delta$ , $\exists n_{0},\text{such that} \forall\ n\geq n_{0}$ 时，我们有

$\text{Pr}\left\{ \mid-\frac{1}{n}\log p(X_{1},X_{2},\dots,X_{n})-H(X)\mid <\epsilon\right\}>1-\delta$
令 $\delta=\epsilon$ 即得证。
$\mid A_{\epsilon}^{(n)}\mid\leq 2^{n(H(X)+\epsilon)}$
$\mid A_{\epsilon}^{(n)}\mid \geq (1-\epsilon)2^{n(H(X)-\epsilon)}$ ，对充分大的 $n$ 成立。

Theorem 3.2.1: Let $X^n$ be i.i.d. $\sim p(x)$ . Let $\epsilon>0$ , then $\exists$ code which maps sequences $x^n$ of length $n$ into binary strings such that the mapping is one-to-one(therefore invertible) and

$E\left[ \frac{1}{n}l(X^n) \right]\leq H(X)+\epsilon$

for n sufficiently large.

3.3 High probability sets and the typical set

Let $B_{\delta }^{(n)}\subset \mathscr{H}^n$ be the any set such that $\text{Pr}\{B_{\delta}^{(n)}\}\geq 1-\delta$ . We argue that $B_{\delta}^{(n)}$ must be significant intersection with $A^{(n)}_{\epsilon}$ and therefore must have about as many elements.

Theorem 3.3.1:
Let $X_{1},X_{2},\dots X_{n}$ be i.i.d ~ $p(x)$ . For $\delta< \frac{1}{2}$ and any $\delta'<0$ , if $Pr\{B^{n}_{\delta}\}>1-\delta$ , then

$\frac{1}{n} \log|B^{(n)}_{\delta}| > H-\delta'\qquad\text{for n sufficiently large}$

3.1 The AEP

3.3 High probability sets and the typical set

Leave a Comment